Multidimensional scaling of combinatorial libraries without explicit enumeration
نویسندگان
چکیده
A novel approach for the multidimensional scaling of large combinatorial libraries is presented. The method employs a multilayer perceptron, which is trained to predict the coordinates of the products on the nonlinear map from pertinent features of their respective building blocks. This method limits the expensive enumeration and descriptor generation to only a small fraction of products and, in addition, relieves the enormous computational effort required for the low-dimensional embedding by conventional iterative multidimensional scaling algorithms. In effect, the method provides an explicit mapping function from reagents to products, and allows the vast majority of compounds to be projected without constructing their connection tables. The advantages of this approach are demonstrated using two combinatorial libraries based on the reductive amination and Ugi reactions, and three descriptor sets that are commonly used in similarity searching, diversity profiling and structure–activity correlation. © 2001 John Wiley & Sons, Inc. J Comput Chem 22: 1712–1722, 2001
منابع مشابه
Multidimensional scaling and visualization of large molecular similarity tables
Multidimensional scaling (MDS) is a collection of statistical techniques that attempt to embed a set of patterns described by means of a dissimilarity matrix into a low-dimensional display plane in a way that preserves their original pairwise interrelationships as closely as possible. Unfortunately, current MDS algorithms are notoriously slow, and their use is limited to small data sets. In thi...
متن کاملThe Statistical Mechanics of Vesicles
I discuss two-dimensional lattice models of closed fluctuating membranes, or vesicles (11. The underlying mathematical model is that of self-avoiding polygons and their enumeration by perimeter and area.. Physically, this model is of interest as its phase diagram contains a point at which tricritical scaling is observed [2]. By adding the constraint of partial directedness, one gets combinatori...
متن کاملStochastic Similarity Selections from Large Combinatorial Libraries
A stochastic procedure for similarity searching in large virtual combinatorial libraries is presented. The method avoids explicit enumeration and calculation of descriptors for every virtual compound, yet provides an optimal or nearly optimal similarity selection in a reasonable time frame. It is based on the principle of probability sampling and the recognition that each reagent is represented...
متن کاملTemplate-based combinatorial enumeration of virtual compound libraries for lipids
A variety of software packages are available for the combinatorial enumeration of virtual libraries for small molecules, starting from specifications of core scaffolds with attachments points and lists of R-groups as SMILES or SD files. Although SD files include atomic coordinates for core scaffolds and R-groups, it is not possible to control 2-dimensional (2D) layout of the enumerated structur...
متن کاملHalf-Century Journey from Synthetic Organic Chemistry to Mathematical Stereochemistry through Chemoinformatics
My half-century journey started from synthetic organic chemistry. During the first stage of my journey, my interest in stereochemistry was initiated through the investigation on the participation of steric effects in reactive intermediates, cylophanes, strained heterocycles, and organic compounds for photography. In chemoinformatics as the next stage of the journey, I proposed the concept of im...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of Computational Chemistry
دوره 22 شماره
صفحات -
تاریخ انتشار 2001